Phylogenetic signals in DNA composition: limitations and prospects.
نویسنده
چکیده
The concept of genome signature allows sequence comparisons without alignment. It relies on the premise that oligonucleotide compositions of DNA segments from the same or closely related genomes tend to be more similar than those from distantly related genomes. This concept has been used in detection of lateral gene transfer, phylogenetic classification of metagenome sequences (binning), and in studies of evolution of viruses and plasmids. The goal of this work is to explore limitations of genome signature in phylogenetic classification of DNA sequences and to identify formal representations of genome signature that expose best the phylogenetic relationships among prokaryotes. We found that genome signatures that best represent phylogenetic relationships are those normalized to factor out differences in G + C content and utilizing the standard A-C-G-T alphabet or the degenerate R-Y (purine-pyrimidine) alphabet. The main limitation of all genome signature representations tested is lack of divergence among some distantly related species. "Crowding" of the genome signature space and absence of molecular clock likely contribute to this phenomenon. We introduce "periodicity signatures"--formal representations of periodic sequence patterns related to DNA curvature--which can discriminate between bacterial and archaeal DNA sequences. Interestingly, archaea of the order Halobacteriaceae have periodic signatures similar to bacteria, possibly due to their early divergence from other archaea, extensive lateral gene transfer, or due to their adaptation to high salt environments. Our results have practical implications for development and application of genome signature-based methods for analysis and classification of DNA sequences.
منابع مشابه
Phylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf
Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...
متن کاملفیلوژنی مولکولی جنس Eumeces Wiegmann, 1834 (خزندگان: سینسیده) در ایران، براساس DNA میتوکندریایی ژن 16S
Phylogenetic relationships among the Eumeces schneiderii princeps and Eumeces schneiderii pavimentatus investigated using 509 bp partial sequences of 16S mitochondrial gene. Analyses were done by maximum-likelihood (RAxML) criteria on 52 specimens from over 20 geographically distinct localities. Our molecular results proposed two well-supported major clades by their phylogenetic positions, gene...
متن کاملPhylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf
Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...
متن کاملA comparative phylogenetic analysis of Theileria spp. by using two two "18S ribosomal RNA" and "Theileria annulata merozoite surface antigen" gene sequences
More than 185 species, strains and unclassified Theileria parasites are categorized in the Entrez Taxonomy. The accurate diagnosis and proper identification of the causative agents are important for understanding the epidemiology, prevention and appropriate treatment. This study aims to discuss the importance of two genes of Theileria annulata 18S ribosomal RNA (18S rRNA) and Theileria annulata...
متن کاملA preliminary study on phylogenetic relationship between five sturgeon species in the Iranian Coastline of the Caspian Sea
The phylogenetic relationship of five sturgeon species in the South Caspian Sea was investigated using mtDNA molecule. Sequence analysis of mtDNA D-loop region of five sturgeon species [Great sturgeon (Huso huso), Russian sturgeon (Acipenser gueldenstaedtii), Persian sturgeon (Acipenser persicus), Ship sturgeon (Acipenser nudiventris), Stellate sturgeon (Acipenser stellatus)] and DNA sequencing...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Molecular biology and evolution
دوره 26 5 شماره
صفحات -
تاریخ انتشار 2009